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Abstract 

Social networks are often represented as directed graphs where the nodes 
are individuals and the edges indicate a form of social relationship. A sim- 
ple way to model the diffusion of ideas, innovative behavior, or "word- 
of-mouth" effects on such a graph is to consider an increasing process of 
"infected" (or active) nodes: each node becomes infected once an activa- 
tion function of the set of its infected neighbors crosses a certain thresh- 
old value. Such a model was introduced by Kempe, Kleinberg, and Tardos 
(KKT) in IIKKT031 IKKT05il where the authors also impose several natural 
assumptions: the threshold values are random and the activation functions 
are monotone and submodular. The monotonicity condition indicates that 
a node is more likely to become active if more of its neighbors are active, 
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while the submodularity condition indicates that the marginal effect of each 
neighbor is decreasing when the set of active neighbors increases. 

For an initial set of active nodes S, let (j{S) denote the expected num- 
ber of active nodes at termination. Here we prove a conjecture of KKT: we 
show that the function (j{S) is submodular under the assumptions above. 
We prove the same result for the expected value of any monotone, submod- 
ular function of the set of active nodes at termination. Roughly, our results 
demonstrate that "local" submodularity is preserved "globally" under this 
diffusion process. This is of natural computational interest, as many op- 
timization problems have good approximation algorithms for submodular 
functions. 



1 Introduction 

Social Networks. In recent years, diffusion processes on social networks have 
been the focus of intense study. While traditionally such processes have been 
of major interest in epidemiology where they model the spread of diseases and 
immunization [|Mor04[ |Lig85[ |Lig99i IDur88l IBBCSOSi IDJ061 . much of the re- 



cent interest has resulted from applications in sociology, economics, and engi- 
neering [|BR871 lARCVOn IGLMO Ibl IGLMO 1 ai iDROTl lRD02l IKKT03 [ iKKTOSll . 

In computer science, a strong motivation for analyzing diffusion processes 
has recently emanated from the study of viral marketing strategies in data mining, 
where various novel algorithmic problems have been considered HDROli IRD02[ 
IKKT031 [KKT051 ■ Roughly speaking, viral marketing — unlike conventional mar- 
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keting — takes into account the "network value" of potential customers, that is, it 
seeks to target a set of individuals whose influence on their social network through 
word-of-mouth effects is high. 

Commonly-used heuristics to identify influential nodes in social networks in- 
clude picking individuals of high degree — so-called degree centrality heuristics — 
or picking individuals with short average distance to the rest of the network — so- 
called distance centrality heuristics [iWF94|| . Typically these heuristics provide 
no guarantees. Here we prove a structural conjecture of Kempe, Kleinberg, and 
Tardos (KKT) [|KKT03[ IKKT05II which in turn implies that a natural greedy al- 
gorithm has a good performance guarantee. The conjecture can be roughly stated 
as follows: if a diffusion model is locally submodular, that is, the influence on an 
individual by its neighbors in the network has "diminishing returns," then the pro- 
cess is globally submodular (on average). This is relevant in this context because, 
under the submodularity property, optimization problems — such as the viral mar- 
keting problem — are known to have good approximation algorithms I1NW88L Li 
particular, in HKKTOBll . greedy algorithms based on the above conjecture were 
shown to achieve significantly better performances in practice than widely-used 
network analysis heuristics. 

General Threshold Model. In [IKKT03B . KKT introduced the general thresh- 
old model, a broad generalization of a variety of natural diffusion models on net- 
works, including the influential linear threshold model of Granovetter in sociol- 
ogy [|Gra78ll . Given an initial set of infected or active individuals on a network, 
the process grows in the following way. (See Section [TT| for a formal description.) 
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Each individual, say v, has an activation function, which measures the effect of its 
neighbors on v, as well as a threshold value. At any time, if the set of previously 
infected neighbors of v is such that its activation function crosses its threshold 
value, then v becomes infected. This process is progressive — an active node stays 
active forever. KKT consider the following natural assumptions: 

- The threshold values are random. This is to account for our lack of knowl- 
edge of the exact threshold values. KKT actually assume that the thresholds 
are uniformly random. Note however that, given that any distribution can be 
generated from a uniform random variable, it follows that, by appropriately 
modifying the activation functions, the threshold values can effectively have 
any distribution. (See Section |3j) 

- The activation functions are monotone increasing. That is, a node is more 
likely to become infected if a larger set of its neighbors is infected. 

- The activation functions are submodular. This corresponds to the fact that 
the marginal effect of each neighbor of v decreases as the set of active nodes 
increases. 

The Influence Maximization Problem. Since the diffusion process defined 
above is increasing, it terminates after a finite number of steps. For a given initial 
set of active nodes S we define a^S) to be the expected size of the set of active 
nodes at the end of the process. In the Influence Maximization Problem, we aim 
to find a set 5 of a fixed size maximizing o-(S). 
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The Influence Maximization Problem is a natural problem to consider in the 
context of viral marketing. Given a social network, it is desired to find a small 
set of "target" individuals so as to maximize the number of customers who will 
eventually purchase a product following the effects of "word-of-mouth" HDROlj 
IRD02II . The same problem may also be of interest in epidemiology where finding 
the set of a fixed size maximizing cr(S) is a natural problem both in terms 
of bounding the spread of a disease and in terms of maximizing the effect of 
immunization. 

M [IKKT03II it was shown that the Influence Maximization Problem is NP- 
hard to approximate within a factor 1 — l/e + e for all £ > and that the problem 
is in fact n^""^ hard to approximate without the submodularity condition. (See, 



e.g., [IGJ791 |ACG"'"99ll for background on NP-completeness and hardness of ap- 
proximation.) On the other hand, it was shown in IIKKT05II that for all e > it is 
possible to find a set S of fixed size that isa(l — 1/e — e) -approximation of the 
maximum in random polynomial time if the set function a is itself submodular. 
This leads to the following conjecture. 

Conjecture 1 ( ||KKT03i IKKT051I ) The function a is submodular. 

While the result of [IKKT031 IKKT05II showed that a is submodular in special 
cases and related models (see below), the general case was open prior to our work. 
In this paper we prove Conjecture [T] and extend it to the case where a{S) is the 
expected value of any monotone, submodular function of the final active set. This 
gives a (1 — 1/e — e) -approximation algorithm for finding a set S of fixed size 



maximizing (t{S). 

1.1 The Model 

In this section, we define formally the general threshold model. 
Definition 1 (Social Network) A social network is given by: 

- A ground set V with \V\ — n 

- A collection o/ activation functions — {fv)vev, where : 2^ — > [0, 1] is 
a [0, l]-valued set function on V. 

Typically, we think of V as the individuals of a social network G = {V, E) where 
each fv measures the effect of v's neighbors N{v) on v. In particular depends 
only on neighbors N{v) affecting v, so fv{S) — fy{N{v) fi S) for all -S". However, 
the specification of the graph will not be needed below. 

Definition 2 (Monotonicity) The function 
/ : 2^ — > M monotone if 

fv(S) < f,{T) 

for all 

SCTCV. 

We say that a collection T of functions from 2^ to R is monotone if all its elements 
are monotone functions. 
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Definition 3 (Submodularity) The function 
f ^Ris submodular if for all S,T CV 

f{S) + f{T)>f{SnT) + f{SUT). 

We say that a collection T of functions from 2^ to R is submodular if all its 
elements are submodular functions. 

The monotonicity condition corresponds to the fact that the effect of a larger set 
on V is stronger than the effect of a smaller set. The submodularity condition is 
equivalent to the fact that if 5" C T and v & V then: 

f{TU{v])-f{T)<f{Syj{v])-f{Sl 

so the effect of each individual is decreasing when the set increases. 

Assumption 1 Throughout, we assume that fv{^) — and that fy is monotone 
and submodular for all v &V. 

We will consider the following process, which is often referred to as a "diffu- 
sion" process in the sociology literature. 

Definition 4 (Diffusion) For a given T, consider the following process, denoted 
S = (^^)"^o^ started at SQ V: 

1. Associate to each node v an independent random variable 9y uniform in 

[0, 1] ; 
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2. Set So = S; 

3. At time t > 1, initialize St = St-i and add to St the set of nodes in V \ St^i 
such that fy{St-i) > 9^ 

Clearly the process stops on or before time n — 1. We denote by Qj^{S) the 
distribution ofS when started at S and write S ~ Qj^{S), where we will drop the 
subscript when T is clear from the context. 

Definition 5 (Influence) For a weight function w : 2^ ^ M+, we define the 
influence cr^i^S) of S V as 

a^(S') = Es[w{Sn-i)], 
where E5 is the expectation under Qjr(S'). 

1.2 Previous Results 

Conjecture [T] was previously verified in several special cases and related models. 

Linear Threshold Model IIKKT03L This is the general threshold model with 
/t, of the form 

fv{s) = by^w, 

for nonnegative constants 6^, The proof uses a representation in terms of a 
related percolation model. See flKKTOBH for details. 

"Normalized" Submodular Threshold Model IIKKTO50 . This is the gen- 
eral threshold model with satisfying the so-called "normalized" submodularity 
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property: 



US u Q}) - US) ^ UT u Q}) - UT) 
I -US) - i-UT) 



(1) 



for all S C T. Note that this is stronger than submodularity. The proof takes 
advantage of an equivalence with the decreasing cascade model (see below). 

Independent Cascade Model IIKKT03L This is a related model where each 
edge (t>, w) has an associated probability p^,^ of being live, independently of all 
other edges. Infected nodes are those connected to the initial set through a live 
path. The proof of Conjecture[T]in this case also uses a percolation argument. 

Decreasing Cascade Model HKKT05L A natural generalization of the previ- 
ous model consists in defining for each v, each neighbor w oiv and each subset of 
neighbors 5 of a success probability Pv{w, S) which is the probability that node 
w will succeed in activating v given that nodes in S are active and have failed to 
activate v. Each node w gets only one chance to activate each of its neighbors. 
KKT impose a natural order-independence condition on the success probabilities, 
that is, the overall success probability of activating v does not depend on the or- 
der in which the active neighbors of v try to activate it. This model — called the 
general cascade model in IIKKT03II — turns out to be equivalent to the general 
threshold model under the maps 



Vviw, S) 
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and 

r 

where S = {wi, . . . , Wr} and Si = {wi, . . . , Wi}. When 

Pt,(ti^, 5') > Pv{w,T) (2) 

for all S C T and all v, w, the model is called the decreasing cascade model. 

It is easy to check that the decreasing cascade model is equivalent to ([T]) under 
the mapping above. The proof of the conjecture for the decreasing cascade model 
works by coupling the processes started at S and T with S C T and then adding 
w in a second phase where condition ([2]) is used. 

Li IIKKT03II . it is also shown that these results carry over to the non-progressive 
case where 9^ is resampled independently at each time step and to general mar- 
keting strategies where one can use several marketing actions simultaneously. We 
refer the reader to [IKKT03II for details. 

Contact Process. A result similar to our Theorem [T] below holds for a related 
contact process llHar74[ Lig85[ ILSS07II . where the infection rates are monotone, 



concave functions of the number of infected neighbours (a special case of mono- 
tone, submodular functions) and where vertices also heal at a constant rate. In 
particular, Harris [|Har74ll uses a coupling argument, although it appears that Har- 
ris' proof does not extend easily to our setting as it uses the special form of the 
infection rates. 

Subsequent Work. After the publication of this work in its extended abstract 

10 



form [IMR07II . further work was devoted to viral marketing, particularly in the 
competitive framework where firms compete for customers IIDGM06[ ICNWvZOTl 
IBKS07[lEDS07ll . 

1.3 Main Result 

Theorem 1 (Main Result) Consider the process defined in Definition^where T 
and w are monotone and submodular. Then, is monotone and submodular. In 
particular, this is true when w is the cardinality function. 

Corollary 1 Consider the process defined in Definition |?] where T and w are 
monotone and submodular. Furthermore, assume that w takes values between 1 
and^iohfin). Then, there exists a (greedy) (1 — 1/e — l/po\j{n)) -approximation 
algorithm for maximizing a^^S) among all sets S of size k l \KKT05\l in time 
poly(n). In particular, this is true when w is the cardinality function. 

The corollary follows from Theorem [1] and Theorem 2 of HKKTOSL KKT's 
Greedy Approximation Algorithm is a simple variant of the standard greedy algo- 
rithm where sampling is used to estimate cr^. 

Our proof. Similarly to HKKTOSll . a natural idea is to run the process in stages. 
Here we use three phases: we first grow An B, then A \ B, and finally B \ A. 
See Figure [T] for an illustration. The key difference is in the execution of the last 
phase. To do away with the "normalized" submodularity condition of HKKTOSL 
we do the following. 
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• We use a careful combination of cascade and threshold models, which we 
call the need-to-know representation. 

• More importantly, we introduce a novel "antisense" couphng technique 
based on the intuition that coupling the processes started at arbitrary sets 
A and B by using 9^ and 1 — 9y respectively, in a way, "maximizes their 
union" (note that 1 — 9^ is also uniform in [0, 1]). This has to be imple- 
mented carefully to also control the intersection. See Section [2] for details. 
See, e.g., llLin92|| for a general reference on the coupling method. 

The rest of the paper is organized as follows. We begin with a few prelimi- 



nary remarks in Section 2.1 The need-to-know representation and the antisense 



process are introduced in Section 2.2 The full coupling and the proof of its cor- 



rectness appear in Section 2.3 



The results of this paper were announced in the form of an extended abstract 
in IIMR07H . 

2 Proof 

Throughout we fix JF and w monotone, submodular. We also fix two arbitrary sets 
A, B C V andlet C = A n B and D = A U B . The idea of the proof is to couple 
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the four processes 



in such a way that 



and 



B = (i?t)r=o' ~ Q{B), 

c = (a)r=o' ~ 
D = (A)r=-o ~ Qp), 

Cn-l ^ n -B„-l, (3) 



Dn-1 C U (4) 

Indeed, we then have the following lemma. 

Lemma 1 Suppose there exists a coupling o/ A, B, C and D satisfying (|?]). 
Then 

(j^{A) + a^{B)>a^{Ar\B) + (j^{AVJB). (5) 
Proof: We have by monotonicity and submodularity of w 

> w{A^^i n + w{An-l U 

> w;(C„„i) + u;(D„_i), (6) 

and therefore, taking expectations we get ([5]). ■ 

Our coupling is based on the following ideas: 
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Antisense coupling. The obvious coupling is to use the same for all 
processes. It is easy to see that such a coupling does not satisfy (|4]). It does 
however satisfy ([3]). Intuitively, using the same 9y for A and B "maximizes 
their intersection" while using 9^ for A and (1 — 9,^) for B "maximizes 
their union." We call this last coupling the antisense coupling. To dominate 
both the intersection and the union simultaneously, we combine these two 
couplings. 

Piecemeal growth. The growth of the four processes can be divided in 
several stages where we add the initial sets progressively. Roughly, the 
coupling below starts by growing A D B, then A \ B and finally B \ A. 
Following our previous comment, the last phase uses the antisense coupling 
to allow the process B to dominate D in that phase. 

Need-to-know representation. Finally, to help carry out the previous re- 
marks, we note that it is not necessary to pick the 9y's at the beginning of the 
process. Instead, at each step, we uncover as little information as possible 



about 9y. This is related to the cascade model of HKKTOSII although here we 
use an explicit combination of cascade and threshold models. 



We explain these ideas next. The proof of Theorem [T] follows in Section 2.3 



2.1 Piecemeal growth 

We first describe an equivalent representation of the process where the initial set 
is added in stages. We denote by Q{S \ 9) the process Q{S) conditioned on = 
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{9y)vev- For a partition S^^\ S'^^^ of S (we allow some of the ^('^^'s to be 
empty), consider the process 

where 

1. For each v E V pick 9^ uniformly and independently in [0, 1] and set T„i = 

0; 

2. For 1 < k < K,we set 

in other words, we add the S'^^'^^'s one at a time and use the same 6'^'s for all 
stages. 

The outcomes of the processes 2(5") and Q{S'^^\ . . . , S'^^'^) have the same dis- 
tribution. This result actually follows from a more general discussion in IIKKT05II . 
but we give a proof here for completeness. 

Lemma 2 (Piecemeal Growth) Let S^^\ . . . , S^^^ be a partition ofSCV. Let 

s = (^t)r=o' ~ Q{si 

and 
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Then Sn^i and T^n-i have the same distribution. 

Proof: Pick 9^ uniformly and independently in [0, 1] for each v & V and let 

S^{St)Z^r^Q{S\9), 



and 



Moreover, let 



and 



T" = (TX=o'-Q{^.---As\e). 

By monotonicity and induction on the K stages, 
But clearly 

rpl rpll Q 

-'-Kn-1 — -'-Kn-1 — '-'n-1 

SO that ^^.i = Tku-I- ■ 
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2.2 Antisense phase and need-to-know representation 

To implement the antisense coupling, we define the following variant of the pro- 
cess. 

Definition 6 Let S^^\ S^^^ be a partition of S and letT QV\S. We define 
the process 

T = (TOjfo^'^"-' ~ Q-{S^'\ ^(^^ T), 

where 

1. For each v G V pick 6y uniformly in [0, 1]; 

2. Let T = {Tt)fX^ - . . . , 5^ | 9); 

3. Set Tku = Tku-i U T; 

4. At time Kn + 1 < t < {K + l)n — 1, initialize — and add to the 

set of nodes in V \ Tj_i such that 

fv{Tt-l) — fv{TKn-l) >^ — 0v 

Lemma 3 (Antisense Pliase) Assume S''^\ . . . , S^^^ is a partition of S and T C 
V\S.Let 

and 

T = (T,)Jfo^')"-' - Q-{S^^\ . . . ,5(^);T). 
17 



Then, S'(x+i)n-i cind T(K+i)n-i have the same distribution. 

Proof: As was discussed at the beginning of Section|2} rather than picking the O^'s 
at the beginning of the process, it is useful to think of them as being progressively 
uncovered on a need-to-know basis. Consider only first stage of the process 
S for the time being. Let S-i = 0. Suppose that, at time t > 1, v ^ St-i. Then 
we have that 9^ E [fviSt-2), 1] and all we need to know to decide if v is added to 
St is whether or not 6',,, G [fv{St~2), fviSt^i)]. In other words, was the increase 
in fy between time t — 2 and t — 1 enough to hit 9^1 Note that, given the event 
{/t;('S't-2) < dy}, dy IS uuiformly distributed in [fviSt-2), 1] and we have that 9^ 
is in [fy{St-2), fviSt-i)] with probabihty 

fv{St-l) — fv{St~2) 

1 - ust-2) ■ 

Therefore, we can describe the process {St)^!^ equivalently as follows. We first 

set S^i = ^, Sq = S. Then, at step l<t<ri — l,we initialize St = St^i and for 
each V eV \ St-i- 

- With probabihty 

fv{St-l) — fviSt-2) 

l-USt-2) ' 
we add v to St and pick 9^ uniformly in 

[MSt-2)JyiSt-l)]; 



Otherwise, we do nothing. 
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By the discussion above, this new version of the process has the same distribution 
as Q{S^^^). We proceed similarly for the following K — 1 stages to get {St)^J^Q~^ 
which is then distributed according to Q{S^^\ . . . , 5'*^^^). 

Up to time Kn — 1, the processes (St) and (Tt) have identical transition prob- 
abilities. Hence, we can take 

Then note that, at time t — Kn, for each v ^ Sku-i — Tku-u we have that 9y is 
uniformly distributed in 

[fv{SKn-l), 1] = [fv{TKn-l), 1] ■ 

For each such v, we now pick 9^ uniformly in [fy(SKn-i): !]• Moreover, we define 
for aWv 

, \ (^v, V e Sku-I, 

I fv{SKn-l) + 1 — ^u, V ^ Sku-I- 

Finally, let 
and 

(T,)Sf+i^"-'~Q(7V„-iUT|^')- 

That is, we run the last stage of S and T as before, with 9 and 9' respectively. 
It is clear that T ~ . . . , S^^'>;T) by construction. Moreover, it follows 
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that S(^K+i)n-i and T(^K+i)n-i have the same distribution from the fact that for a 
uniform variable 9^ in [fviSKn-i), 1]» the random variables 6'^ and fv{SKn-i) + 
\ — 6v have the same distribution. ■ 

2.3 Coupling 

We are now ready to prove Theorem [1] We will need the following easy conse- 
quence of monotone submodularity. 

Lemma 4 Let / : 2^ — > ]R_,. be monotone and submodular. Then if S <^ S' <^ V 
and T C T' ^ V, we have 

fiSUT')-f{S)>fiS'UT)-f{S'). 

Proof: Note that by monotonicity and submodularity 

f{SUT')-f{S) > f{SUT)-f{S) = f{SU{T\S))-f{S) 

> f{S U {S' \ (T U S)) U (T \ S)) - f{S U {S' \ (T U S))) 

> fiS'UT)-f{S'). 



We now give a proof of Theorem [T| 
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Proof: We proceed with our coupling of A, B, C, and D. In fact, by Lemmas [T] 
|2} and[3| it suffices instead to couple 



C = (a)?:o'~2(^n 5,0,0), 

D = (A)?:o ' r^Q-{AnB,A\B-B\ A), 

in such a way that for all < t < — 1 

Ct C At n Bt, Dt<^AtU Bt. (8) 

Our coupling is as follows. We pick 0^ uniformly in [0, 1] for all v eV and use the 
same for all four processes above. See Figure[T]for a graphical representation of 
the proof. 

By construction, for all < t < 2n — 1 we have 

Bt = Ct^ At 

so that 

Ct = Atn Bt. 

Similarly for all < t < 2n — 1 we have Dt = At so that 

DtCAtU Bt. 
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Phase AnB 



Phase A\B 



Ao = An B ^ A„ 




B 



Bn = An B ^ Bn 



Bn-i U -» B2„ 



i 



Co = An B ^ C„ 



Cn-1 U ^ C2„ 



I 



D 



Do = An B ^ D„_i 



U (A\ S) ^ -D2n-1 




Phase B \ ^ 



U (A\ S) ^ A2n-1 A2„_i U ^ A3„_i 




B2„-l U(B\A) ^ B3„-l 




C2„-l U -» C3„_i 

A3„_i n -B3„_l 



D2n-1 U (S \ A) ^ i33. 




A3„_i U S3,i-1 












1 


1.' > > 1 


1 




1 




1 




1 




1 




1 







Figure 1 : The three phases of the coupling. In each phase, the dark shaded region 
is the initial set, while the light shaded region is the final set. The sets A and B are 
indicated by dashed lines. The thick dashed lines show that the desired properties 
are satisfied. 
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Thus ^ is satisfied for < t < 2n - 1. To see ([8]) holds also for 2n <t <3n-l, 
note that by Lemma |4] for all v ^ 

fv{.B2n) — fv{.B2n-l) > fv{D2n) — fv{.D2n-l), 

since B2n-i C £'2^-1, 

B2n = B2n-lU{B\A), 

and 

D2n = D2n-iy^{B\A). 

(See Figure|2]for an illustration of this step.) We proceed by induction. By mono- 
tonicity and Lemma |4} we then have for all 2n < t < 3n — 1 

A \ An-l C Bt \ fi2n-l, 
(^2,0 UBt) - UB2n-l) > fv{Dt) - /.(An-l), 

for all V ^ i52rf Indeed, assume (^^i,t') and (^^2,*') for all 2n < t' < t. We have 
already proved the base case t = 2n. Then (^2,1) implies i+i) by definition of 
the process in the antisense phase (Definition |6]). In turn, (fii t+i) implies (i^2,t+i) 
by monotonicity and Lemma |4} 

This proves the claim since we then have for all 2n < t < 3n — 1, At = D2n-i 

and 

Dt \ An-l C Bt 



Mb,,-,) 



(A) 
















0. 



0/v(^2j-/v(A„l) 



> 



/v(AJ-/v(A„l)0 



Figure 2: Last phase of the coupling seen from a node v. In the antisense phase, 
the interval is filled up "from the end." By submodularity, the process (Bt) reaches 
the threshold first. 



which implies 



The condition 



is clear from the construction. 



DtCAtU Bt. 



Ct<^Atn Bt 



3 Concluding Remarks 

Necessity. It is easy to see that the submodularity assumption in Theorem [T] is 
necessary in the following sense: Any function / which is not submodular ad- 
mits a network with activation function / where the influence is not submodular. 
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Indeed, let f ■.2^ ^R+,A,B CV such that 

f{A) + f{B)<f{AnB) + f{AUB). 

Let V* — V U {v*} with f^* = f and /y = 1 for all v eV. It is then immediate 
to check that: 

a{A) + a{B) = \A\ + \B\ + f{A) + f{B) 
= \Ar]B\ + \AuB\ 

+f{A) + m 

< \AnB\ + \AuB\ 

+f{AnB) + f{AUB) 
= a{AnB) + a{AUB). 

Other threshold distributions. As noted in the introduction, our results hold for 
general threshold distributions. Assume that 9y has cumulative distribution F„. 
Notice that the dynamics can be re- written as 

MSt-i) >e, = F-\u,) ^ FMSt-i)) > u„ 

where the U^'s are independent uniform in [0, 1]. Hence, if F„ o is increasing 
submodular for all v then the influence function is submodular. Furthermore, the 
example above demontrates that this condition is necessary. 
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